This document describes the replication files associated with the paper:
Yoshikuni Ono and Hirofumi Miwa. “Gender Differences in Campaigning Under Alternative Voting Systems: Analysis of Election Manifestos.” Politics, Groups, and Identities.

The following files are contained herein:
0_data_preparation.R (R code to generate data)
1_main_analysis_model1.R (R code to replicate the analysis by Model (1), which is reported in the main text and Online Appendices A, B, and C)
2_main_analysis_model2.R (R code to replicate the analysis by Model (2), which is reported in the main text)
3_RC_no_covariate_model.R (R code to replicate the robustness check using the no-covariate model, which is reported in Online Appendix D)
4_MC_simulation_model2.R (R code to replicate Monte Carlo simulation on the specification of Model (2), which is reported in Online Appendix E)
5_RC_successful_candidates.R (R code to replicate the robustness check for successful candidates, which is reported in Online Appendix F)
6_RC_district_urbanness.R (R code to replicate the robustness check controlling for district urbaneness, which is reported in Online Appendix G)
7_female_candidate_strategy.R (R code to replicate the analysis of female candidate strategy when she faces female opponents, which is reported in Online Appendix H)
DID_2003.csv (the data of district-level densely inhabited district ratio in 2003 retrieved from Taku Sugawara’s personal website, which is not currently available)
name_correction_list.csv (data necessary for combining Amy Catalinac’s document-term matrix and the Reed-Smith Japanese House of Representatives Elections Dataset)

All R files are encoded in UTF-8. Some of them contain Japanese characters and will get garbled when opened with another encoding.

In addition to the above files, you should prepare the following files and locate them in your working directory:
- all.1986-2009.reduced.csv, which is contained in Catalinac (2018)’s replication materials (https://doi.org/10.7910/DVN/PENDX4)
- Reed-Smith-JHRED-CANDIDATES.dta, i.e., the Reed-Smith Japanese House of Representatives Elections Dataset (https://doi.org/10.7910/DVN/QFEPXD)
- 選挙区別集計_03年齢別×男女別人口・外国人人口.csv and 選挙区別集計_12産業別×男女別15歳以上就業者数.csv, which are the data of district-level population census in 2002 and can be retrieved from Akira Nishizawa’s website (https://home.csis.u-tokyo.ac.jp/~nishizawa/senkyoku/). Download senkyoku300.zip from the link named “国勢調査集計データ（csv形式）” below “300選挙区（2002年改訂）” and unzip it, and you can find these CSV files.

First, run 0_data_preparation.R and create dfm_matrix.Rdata. Then, run 1_main_analysis_model1.R and save STM_result_model1_75.Rdata. Each of the other R codes works independently from other codes.

We used quanteda package version 1.3.4 and stm package version 1.3.3. We found that we obtain slightly different results from the original ones reported in the paper if we use other versions of these packages. However, we believe that our substantial conclusions still hold.

If you have any questions or concerns about the files, please contact Hirofumi Miwa.